Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 14958 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.4 MiB |
| Average record size in memory | 168.0 B |
Variable types
| NUM | 12 |
|---|---|
| CAT | 1 |
| DATE | 1 |
Reproduction
| Analysis started | 2021-05-09 08:11:59.729883 |
|---|---|
| Analysis finished | 2021-05-09 08:12:33.222763 |
| Duration | 33.49 seconds |
| Version | pandas-profiling v2.7.1 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
bluecars_returned_sum is highly correlated with bluecars_taken_sum and 4 other fields | High correlation |
bluecars_taken_sum is highly correlated with bluecars_returned_sum and 4 other fields | High correlation |
utilib_returned_sum is highly correlated with utilib_taken_sum | High correlation |
utilib_taken_sum is highly correlated with utilib_returned_sum | High correlation |
utilib_14_taken_sum is highly correlated with bluecars_taken_sum and 2 other fields | High correlation |
utilib_14_returned_sum is highly correlated with bluecars_taken_sum and 2 other fields | High correlation |
slots_freed_sum is highly correlated with bluecars_taken_sum and 2 other fields | High correlation |
slots_taken_sum is highly correlated with bluecars_taken_sum and 2 other fields | High correlation |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
dayofweek has 2374 (15.9%) zeros | Zeros |
utilib_taken_sum has 4972 (33.2%) zeros | Zeros |
utilib_returned_sum has 4909 (32.8%) zeros | Zeros |
utilib_14_taken_sum has 2605 (17.4%) zeros | Zeros |
utilib_14_returned_sum has 2568 (17.2%) zeros | Zeros |
slots_freed_sum has 9492 (63.5%) zeros | Zeros |
slots_taken_sum has 9499 (63.5%) zeros | Zeros |
| Distinct count | 14958 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8046.0734723893565 |
|---|---|
| Minimum | 0 |
| Maximum | 16083 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 804.7 |
| Q1 | 4030.25 |
| median | 8050.5 |
| Q3 | 12063.75 |
| 95-th percentile | 15280.15 |
| Maximum | 16083 |
| Range | 16083 |
| Interquartile range (IQR) | 8033.5 |
Descriptive statistics
| Standard deviation | 4642.145244 |
|---|---|
| Coefficient of variation (CV) | 0.5769454206 |
| Kurtosis | -1.199174841 |
| Mean | 8046.073472 |
| Median Absolute Deviation (MAD) | 4017 |
| Skewness | -0.00150783443 |
| Sum | 120353167 |
| Variance | 21549512.46 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 6822 | 1 | < 0.1% | |
| 15042 | 1 | < 0.1% | |
| 8897 | 1 | < 0.1% | |
| 10944 | 1 | < 0.1% | |
| 4791 | 1 | < 0.1% | |
| 6838 | 1 | < 0.1% | |
| 693 | 1 | < 0.1% | |
| 2740 | 1 | < 0.1% | |
| 12979 | 1 | < 0.1% | |
| Other values (14948) | 14948 | 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 16083 | 1 | < 0.1% | |
| 16082 | 1 | < 0.1% | |
| 16081 | 1 | < 0.1% | |
| 16080 | 1 | < 0.1% | |
| 16079 | 1 | < 0.1% |
postal_code
Real number (ℝ≥0)
| Distinct count | 104 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88800.6020189865 |
|---|---|
| Minimum | 75001 |
| Maximum | 95880 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 75001 |
|---|---|
| 5-th percentile | 75006 |
| Q1 | 91330 |
| median | 92340 |
| Q3 | 93400 |
| 95-th percentile | 94500 |
| Maximum | 95880 |
| Range | 20879 |
| Interquartile range (IQR) | 2070 |
Descriptive statistics
| Standard deviation | 7641.51636 |
|---|---|
| Coefficient of variation (CV) | 0.08605252877 |
| Kurtosis | -0.5342936433 |
| Mean | 88800.60202 |
| Median Absolute Deviation (MAD) | 1030 |
| Skewness | -1.172012811 |
| Sum | 1328279405 |
| Variance | 58392772.28 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 94130 | 145 | 1.0% | |
| 92190 | 145 | 1.0% | |
| 94300 | 145 | 1.0% | |
| 94340 | 145 | 1.0% | |
| 94500 | 145 | 1.0% | |
| 78140 | 145 | 1.0% | |
| 94700 | 145 | 1.0% | |
| 95100 | 145 | 1.0% | |
| 75006 | 145 | 1.0% | |
| 75014 | 145 | 1.0% | |
| Other values (94) | 13508 | 90.3% |
| Value | Count | Frequency (%) | |
| 75001 | 145 | 1.0% | |
| 75002 | 145 | 1.0% | |
| 75003 | 145 | 1.0% | |
| 75004 | 145 | 1.0% | |
| 75005 | 145 | 1.0% |
| Value | Count | Frequency (%) | |
| 95880 | 145 | 1.0% | |
| 95870 | 145 | 1.0% | |
| 95100 | 145 | 1.0% | |
| 94800 | 145 | 1.0% | |
| 94700 | 145 | 1.0% |
date
Date
| Distinct count | 145 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 117.0 KiB |
| Minimum | 2018-01-01 00:00:00 |
|---|---|
| Maximum | 2018-06-18 00:00:00 |
Histogram
daily_data_points
Real number (ℝ≥0)
| Distinct count | 12 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1438.563043187592 |
|---|---|
| Minimum | 1411 |
| Maximum | 1440 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 1411 |
|---|---|
| 5-th percentile | 1434 |
| Q1 | 1439 |
| median | 1440 |
| Q3 | 1440 |
| 95-th percentile | 1440 |
| Maximum | 1440 |
| Range | 29 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 4.378831957 |
|---|---|
| Coefficient of variation (CV) | 0.003043892986 |
| Kurtosis | 19.7081417 |
| Mean | 1438.563043 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -4.385874416 |
| Sum | 21518026 |
| Variance | 19.1741693 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1440 | 10109 | 67.6% | |
| 1439 | 2578 | 17.2% | |
| 1438 | 721 | 4.8% | |
| 1437 | 411 | 2.7% | |
| 1434 | 207 | 1.4% | |
| 1425 | 207 | 1.4% | |
| 1417 | 206 | 1.4% | |
| 1429 | 104 | 0.7% | |
| 1436 | 104 | 0.7% | |
| 1435 | 104 | 0.7% | |
| Other values (2) | 207 | 1.4% |
| Value | Count | Frequency (%) | |
| 1411 | 104 | 0.7% | |
| 1417 | 206 | 1.4% | |
| 1420 | 103 | 0.7% | |
| 1425 | 207 | 1.4% | |
| 1429 | 104 | 0.7% |
| Value | Count | Frequency (%) | |
| 1440 | 10109 | 67.6% | |
| 1439 | 2578 | 17.2% | |
| 1438 | 721 | 4.8% | |
| 1437 | 411 | 2.7% | |
| 1436 | 104 | 0.7% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9381601818424925 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 2374 |
| Zeros (%) | 15.9% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.042206884 |
|---|---|
| Coefficient of variation (CV) | 0.6950631543 |
| Kurtosis | -1.304166968 |
| Mean | 2.938160182 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.03519677094 |
| Sum | 43949 |
| Variance | 4.170608957 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 2374 | 15.9% | |
| 1 | 2269 | 15.2% | |
| 6 | 2169 | 14.5% | |
| 4 | 2168 | 14.5% | |
| 2 | 2062 | 13.8% | |
| 5 | 2061 | 13.8% | |
| 3 | 1855 | 12.4% |
| Value | Count | Frequency (%) | |
| 0 | 2374 | 15.9% | |
| 1 | 2269 | 15.2% | |
| 2 | 2062 | 13.8% | |
| 3 | 1855 | 12.4% | |
| 4 | 2168 | 14.5% |
| Value | Count | Frequency (%) | |
| 6 | 2169 | 14.5% | |
| 5 | 2061 | 13.8% | |
| 4 | 2168 | 14.5% | |
| 3 | 1855 | 12.4% | |
| 2 | 2062 | 13.8% |
day_type
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 117.0 KiB |
| weekday | |
|---|---|
| weekend |
| Value | Count | Frequency (%) | |
| weekday | 10728 | 71.7% | |
| weekend | 4230 | 28.3% |
Length
| Max length | 7 |
|---|---|
| Mean length | 7 |
| Min length | 7 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 7 | 100.0% |
| Value | Count | Frequency (%) | |
| Latin | 7 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 7 | 100.0% |
| Distinct count | 921 |
|---|---|
| Unique (%) | 6.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 127.36889958550609 |
|---|---|
| Minimum | 0 |
| Maximum | 1255 |
| Zeros | 42 |
| Zeros (%) | 0.3% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 20 |
| median | 47 |
| Q3 | 138 |
| 95-th percentile | 529 |
| Maximum | 1255 |
| Range | 1255 |
| Interquartile range (IQR) | 118 |
Descriptive statistics
| Standard deviation | 185.2157701 |
|---|---|
| Coefficient of variation (CV) | 1.454167938 |
| Kurtosis | 5.681815338 |
| Mean | 127.3688996 |
| Median Absolute Deviation (MAD) | 35 |
| Skewness | 2.346092262 |
| Sum | 1905184 |
| Variance | 34304.88149 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 12 | 235 | 1.6% | |
| 11 | 232 | 1.6% | |
| 14 | 231 | 1.5% | |
| 9 | 225 | 1.5% | |
| 10 | 221 | 1.5% | |
| 13 | 218 | 1.5% | |
| 16 | 195 | 1.3% | |
| 7 | 194 | 1.3% | |
| 15 | 194 | 1.3% | |
| 20 | 193 | 1.3% | |
| Other values (911) | 12820 | 85.7% |
| Value | Count | Frequency (%) | |
| 0 | 42 | 0.3% | |
| 1 | 95 | 0.6% | |
| 2 | 118 | 0.8% | |
| 3 | 155 | 1.0% | |
| 4 | 146 | 1.0% |
| Value | Count | Frequency (%) | |
| 1255 | 1 | < 0.1% | |
| 1248 | 1 | < 0.1% | |
| 1209 | 2 | < 0.1% | |
| 1186 | 1 | < 0.1% | |
| 1164 | 1 | < 0.1% |
| Distinct count | 912 |
|---|---|
| Unique (%) | 6.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 127.3550608370103 |
|---|---|
| Minimum | 0 |
| Maximum | 1271 |
| Zeros | 17 |
| Zeros (%) | 0.1% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 21 |
| median | 47 |
| Q3 | 137 |
| 95-th percentile | 534 |
| Maximum | 1271 |
| Range | 1271 |
| Interquartile range (IQR) | 116 |
Descriptive statistics
| Standard deviation | 185.4304604 |
|---|---|
| Coefficient of variation (CV) | 1.456011714 |
| Kurtosis | 5.756299107 |
| Mean | 127.3550608 |
| Median Absolute Deviation (MAD) | 34 |
| Skewness | 2.357224532 |
| Sum | 1904977 |
| Variance | 34384.45566 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 12 | 236 | 1.6% | |
| 13 | 236 | 1.6% | |
| 17 | 226 | 1.5% | |
| 11 | 222 | 1.5% | |
| 10 | 222 | 1.5% | |
| 9 | 220 | 1.5% | |
| 14 | 213 | 1.4% | |
| 18 | 198 | 1.3% | |
| 15 | 198 | 1.3% | |
| 22 | 194 | 1.3% | |
| Other values (902) | 12793 | 85.5% |
| Value | Count | Frequency (%) | |
| 0 | 17 | 0.1% | |
| 1 | 90 | 0.6% | |
| 2 | 117 | 0.8% | |
| 3 | 142 | 0.9% | |
| 4 | 141 | 0.9% |
| Value | Count | Frequency (%) | |
| 1271 | 1 | < 0.1% | |
| 1230 | 1 | < 0.1% | |
| 1214 | 1 | < 0.1% | |
| 1211 | 1 | < 0.1% | |
| 1210 | 1 | < 0.1% |
| Distinct count | 46 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.7297098542585907 |
|---|---|
| Minimum | 0 |
| Maximum | 47 |
| Zeros | 4972 |
| Zeros (%) | 33.2% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 17 |
| Maximum | 47 |
| Range | 47 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 5.789643488 |
|---|---|
| Coefficient of variation (CV) | 1.55230399 |
| Kurtosis | 7.098757084 |
| Mean | 3.729709854 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.488247194 |
| Sum | 55789 |
| Variance | 33.51997171 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 4972 | 33.2% | |
| 1 | 2750 | 18.4% | |
| 2 | 1664 | 11.1% | |
| 3 | 1111 | 7.4% | |
| 4 | 745 | 5.0% | |
| 5 | 565 | 3.8% | |
| 6 | 432 | 2.9% | |
| 7 | 315 | 2.1% | |
| 8 | 302 | 2.0% | |
| 9 | 240 | 1.6% | |
| Other values (36) | 1862 | 12.4% |
| Value | Count | Frequency (%) | |
| 0 | 4972 | 33.2% | |
| 1 | 2750 | 18.4% | |
| 2 | 1664 | 11.1% | |
| 3 | 1111 | 7.4% | |
| 4 | 745 | 5.0% |
| Value | Count | Frequency (%) | |
| 47 | 1 | < 0.1% | |
| 46 | 1 | < 0.1% | |
| 45 | 1 | < 0.1% | |
| 43 | 1 | < 0.1% | |
| 42 | 1 | < 0.1% |
| Distinct count | 47 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.7323840085572937 |
|---|---|
| Minimum | 0 |
| Maximum | 47 |
| Zeros | 4909 |
| Zeros (%) | 32.8% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 17 |
| Maximum | 47 |
| Range | 47 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 5.797753201 |
|---|---|
| Coefficient of variation (CV) | 1.553364602 |
| Kurtosis | 7.204123791 |
| Mean | 3.732384009 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.501348695 |
| Sum | 55829 |
| Variance | 33.61394218 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 4909 | 32.8% | |
| 1 | 2798 | 18.7% | |
| 2 | 1727 | 11.5% | |
| 3 | 1060 | 7.1% | |
| 4 | 782 | 5.2% | |
| 5 | 537 | 3.6% | |
| 6 | 409 | 2.7% | |
| 7 | 348 | 2.3% | |
| 8 | 297 | 2.0% | |
| 9 | 226 | 1.5% | |
| Other values (37) | 1865 | 12.5% |
| Value | Count | Frequency (%) | |
| 0 | 4909 | 32.8% | |
| 1 | 2798 | 18.7% | |
| 2 | 1727 | 11.5% | |
| 3 | 1060 | 7.1% | |
| 4 | 782 | 5.2% |
| Value | Count | Frequency (%) | |
| 47 | 1 | < 0.1% | |
| 45 | 1 | < 0.1% | |
| 44 | 1 | < 0.1% | |
| 43 | 2 | < 0.1% | |
| 42 | 1 | < 0.1% |
| Distinct count | 91 |
|---|---|
| Unique (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.6936087712261 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros | 2605 |
| Zeros (%) | 17.4% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 4 |
| Q3 | 10 |
| 95-th percentile | 37 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 12.85855496 |
|---|---|
| Coefficient of variation (CV) | 1.479081391 |
| Kurtosis | 6.859179587 |
| Mean | 8.693608771 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.467497906 |
| Sum | 130039 |
| Variance | 165.3424356 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 2605 | 17.4% | |
| 1 | 2022 | 13.5% | |
| 2 | 1599 | 10.7% | |
| 3 | 1242 | 8.3% | |
| 4 | 987 | 6.6% | |
| 5 | 742 | 5.0% | |
| 6 | 634 | 4.2% | |
| 7 | 477 | 3.2% | |
| 8 | 392 | 2.6% | |
| 9 | 333 | 2.2% | |
| Other values (81) | 3925 | 26.2% |
| Value | Count | Frequency (%) | |
| 0 | 2605 | 17.4% | |
| 1 | 2022 | 13.5% | |
| 2 | 1599 | 10.7% | |
| 3 | 1242 | 8.3% | |
| 4 | 987 | 6.6% |
| Value | Count | Frequency (%) | |
| 100 | 1 | < 0.1% | |
| 94 | 1 | < 0.1% | |
| 93 | 1 | < 0.1% | |
| 91 | 1 | < 0.1% | |
| 90 | 1 | < 0.1% |
| Distinct count | 92 |
|---|---|
| Unique (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.69247225564915 |
|---|---|
| Minimum | 0 |
| Maximum | 96 |
| Zeros | 2568 |
| Zeros (%) | 17.2% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 10 |
| 95-th percentile | 37 |
| Maximum | 96 |
| Range | 96 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 12.85770887 |
|---|---|
| Coefficient of variation (CV) | 1.479177442 |
| Kurtosis | 6.850342393 |
| Mean | 8.692472256 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.467541735 |
| Sum | 130022 |
| Variance | 165.3206774 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 2568 | 17.2% | |
| 1 | 2041 | 13.6% | |
| 2 | 1629 | 10.9% | |
| 3 | 1293 | 8.6% | |
| 4 | 933 | 6.2% | |
| 5 | 770 | 5.1% | |
| 6 | 573 | 3.8% | |
| 7 | 479 | 3.2% | |
| 8 | 383 | 2.6% | |
| 9 | 368 | 2.5% | |
| Other values (82) | 3921 | 26.2% |
| Value | Count | Frequency (%) | |
| 0 | 2568 | 17.2% | |
| 1 | 2041 | 13.6% | |
| 2 | 1629 | 10.9% | |
| 3 | 1293 | 8.6% | |
| 4 | 933 | 6.2% |
| Value | Count | Frequency (%) | |
| 96 | 1 | < 0.1% | |
| 94 | 2 | < 0.1% | |
| 93 | 1 | < 0.1% | |
| 90 | 1 | < 0.1% | |
| 89 | 4 | < 0.1% |
| Distinct count | 289 |
|---|---|
| Unique (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.865222623345367 |
|---|---|
| Minimum | 0 |
| Maximum | 344 |
| Zeros | 9492 |
| Zeros (%) | 63.5% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 5 |
| 95-th percentile | 150 |
| Maximum | 344 |
| Range | 344 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 52.25793121 |
|---|---|
| Coefficient of variation (CV) | 2.285476598 |
| Kurtosis | 6.06424788 |
| Mean | 22.86522262 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.548047338 |
| Sum | 342018 |
| Variance | 2730.891375 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 9492 | 63.5% | |
| 1 | 495 | 3.3% | |
| 2 | 453 | 3.0% | |
| 3 | 365 | 2.4% | |
| 4 | 311 | 2.1% | |
| 5 | 224 | 1.5% | |
| 6 | 169 | 1.1% | |
| 7 | 112 | 0.7% | |
| 8 | 89 | 0.6% | |
| 9 | 74 | 0.5% | |
| Other values (279) | 3174 | 21.2% |
| Value | Count | Frequency (%) | |
| 0 | 9492 | 63.5% | |
| 1 | 495 | 3.3% | |
| 2 | 453 | 3.0% | |
| 3 | 365 | 2.4% | |
| 4 | 311 | 2.1% |
| Value | Count | Frequency (%) | |
| 344 | 1 | < 0.1% | |
| 334 | 1 | < 0.1% | |
| 330 | 1 | < 0.1% | |
| 322 | 1 | < 0.1% | |
| 319 | 3 | < 0.1% |
| Distinct count | 292 |
|---|---|
| Unique (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.86923385479342 |
|---|---|
| Minimum | 0 |
| Maximum | 349 |
| Zeros | 9499 |
| Zeros (%) | 63.5% |
| Memory size | 117.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 5 |
| 95-th percentile | 151 |
| Maximum | 349 |
| Range | 349 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 52.29121544 |
|---|---|
| Coefficient of variation (CV) | 2.286531144 |
| Kurtosis | 6.071606887 |
| Mean | 22.86923385 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.549361753 |
| Sum | 342078 |
| Variance | 2734.371212 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 9499 | 63.5% | |
| 1 | 494 | 3.3% | |
| 2 | 450 | 3.0% | |
| 3 | 374 | 2.5% | |
| 4 | 294 | 2.0% | |
| 5 | 223 | 1.5% | |
| 6 | 164 | 1.1% | |
| 7 | 131 | 0.9% | |
| 8 | 91 | 0.6% | |
| 9 | 66 | 0.4% | |
| Other values (282) | 3172 | 21.2% |
| Value | Count | Frequency (%) | |
| 0 | 9499 | 63.5% | |
| 1 | 494 | 3.3% | |
| 2 | 450 | 3.0% | |
| 3 | 374 | 2.5% | |
| 4 | 294 | 2.0% |
| Value | Count | Frequency (%) | |
| 349 | 1 | < 0.1% | |
| 330 | 1 | < 0.1% | |
| 328 | 1 | < 0.1% | |
| 326 | 1 | < 0.1% | |
| 322 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | postal_code | date | daily_data_points | dayofweek | day_type | bluecars_taken_sum | bluecars_returned_sum | utilib_taken_sum | utilib_returned_sum | utilib_14_taken_sum | utilib_14_returned_sum | slots_freed_sum | slots_taken_sum | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 75001 | 2018-01-01 | 1440 | 0 | weekday | 110 | 103 | 3 | 2 | 10 | 9 | 22 | 20 |
| 1 | 1 | 75001 | 2018-01-02 | 1438 | 1 | weekday | 98 | 94 | 1 | 1 | 8 | 8 | 23 | 22 |
| 2 | 2 | 75001 | 2018-01-03 | 1439 | 2 | weekday | 138 | 139 | 0 | 0 | 2 | 2 | 27 | 27 |
| 3 | 4 | 75001 | 2018-01-05 | 1440 | 4 | weekday | 114 | 117 | 3 | 3 | 6 | 6 | 18 | 20 |
| 4 | 5 | 75001 | 2018-01-06 | 1437 | 5 | weekend | 187 | 185 | 6 | 6 | 7 | 8 | 38 | 35 |
| 5 | 6 | 75001 | 2018-01-07 | 1440 | 6 | weekend | 180 | 180 | 2 | 2 | 10 | 9 | 34 | 34 |
| 6 | 7 | 75001 | 2018-01-08 | 1438 | 0 | weekday | 84 | 83 | 3 | 3 | 10 | 10 | 14 | 15 |
| 7 | 8 | 75001 | 2018-01-09 | 1439 | 1 | weekday | 81 | 84 | 1 | 1 | 4 | 4 | 15 | 15 |
| 8 | 9 | 75001 | 2018-01-10 | 1440 | 2 | weekday | 88 | 85 | 5 | 5 | 11 | 11 | 23 | 22 |
| 9 | 10 | 75001 | 2018-01-11 | 1440 | 3 | weekday | 125 | 125 | 3 | 4 | 13 | 13 | 22 | 22 |
Last rows
| df_index | postal_code | date | daily_data_points | dayofweek | day_type | bluecars_taken_sum | bluecars_returned_sum | utilib_taken_sum | utilib_returned_sum | utilib_14_taken_sum | utilib_14_returned_sum | slots_freed_sum | slots_taken_sum | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 14948 | 16074 | 95880 | 2018-06-09 | 1440 | 5 | weekend | 15 | 15 | 0 | 0 | 1 | 2 | 0 | 0 |
| 14949 | 16075 | 95880 | 2018-06-10 | 1440 | 6 | weekend | 34 | 32 | 0 | 0 | 1 | 0 | 0 | 0 |
| 14950 | 16076 | 95880 | 2018-06-11 | 1440 | 0 | weekday | 17 | 18 | 0 | 0 | 0 | 0 | 0 | 0 |
| 14951 | 16077 | 95880 | 2018-06-12 | 1439 | 1 | weekday | 25 | 25 | 0 | 0 | 0 | 0 | 0 | 0 |
| 14952 | 16078 | 95880 | 2018-06-13 | 1440 | 2 | weekday | 12 | 13 | 0 | 0 | 1 | 1 | 0 | 0 |
| 14953 | 16079 | 95880 | 2018-06-14 | 1439 | 3 | weekday | 15 | 13 | 0 | 0 | 0 | 0 | 0 | 0 |
| 14954 | 16080 | 95880 | 2018-06-15 | 1440 | 4 | weekday | 15 | 10 | 0 | 0 | 2 | 3 | 0 | 0 |
| 14955 | 16081 | 95880 | 2018-06-16 | 1440 | 5 | weekend | 19 | 19 | 0 | 0 | 2 | 1 | 0 | 0 |
| 14956 | 16082 | 95880 | 2018-06-17 | 1440 | 6 | weekend | 33 | 35 | 1 | 1 | 0 | 0 | 0 | 0 |
| 14957 | 16083 | 95880 | 2018-06-18 | 1440 | 0 | weekday | 11 | 14 | 3 | 5 | 2 | 2 | 0 | 0 |